AITopics | streaming submodular maximization

Do Less, Get More: Streaming Submodular Maximization with Subsampling

Neural Information Processing SystemsNov-20-2025, 23:02:03 GMT

In this paper, we develop the first one-pass streaming algorithm for submodular maximization that does not evaluate the entire stream even once. By carefully subsampling each element of the data stream, our algorithm enjoys the tightest approximation guarantees in various settings while having the smallest memory footprint and requiring the lowest number of function evaluations. More specifically, for a monotone submodular function and a $p$-matchoid constraint, our randomized algorithm achieves a $4p$ approximation ratio (in expectation) with $O(k)$ memory and $O(km/p)$ queries per element ($k$ is the size of the largest feasible solution and $m$ is the number of matroids used to define the constraint).

algorithm, name change, streaming submodular maximization, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.36)

Add feedback

Fairness in Streaming Submodular Maximization: Algorithms and Hardness

Neural Information Processing SystemsMay-27-2025, 07:09:54 GMT

Submodular maximization has become established as the method of choice for the task of selecting representative and diverse summaries of data. However, if datapoints have sensitive attributes such as gender or age, such machine learning algorithms, left unchecked, are known to exhibit bias: under- or over-representation of particular groups. This has made the design of fair machine learning algorithms increasingly important. In this work we address the question: Is it possible to create fair summaries for massive datasets? To this end, we develop the first streaming approximation algorithms for submodular maximization under fairness constraints, for both monotone and non-monotone functions. We validate our findings empirically on exemplar-based clustering, movie recommendation, DPP-based summarization, and maximum coverage in social networks, showing that fairness constraints do not significantly impact utility.

artificial intelligence, machine learning, streaming submodular maximization, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Review for NeurIPS paper: Fairness in Streaming Submodular Maximization: Algorithms and Hardness

Neural Information Processing SystemsJan-27-2025, 00:58:03 GMT

Additional Feedback: As I mentioned before, I would encourage authors to change the initial framing of the paper, which promises to "giv[e] affirmative answers" to the question of whether "it is possible to create fair summaries for massive datasets". I think this part of the abstract might be better served describing the fairness constraint, so that the reader better understands this aspect. There also seems to be some odd formatting with bold and numbered text in the first paragraph on Section 6. 2. On Line 22, the sentence "submodularity is a natural way to capture the diminishing returns property of set functions, which holds for a variety of machine learning problems" is slightly awkward and misleading as written. It appears as if all set functions have an inherent diminishing returns property and that submodularity is capturing this. Of course, fixing this is just a matter of rephrasing.

algorithm and hardness, fairness, streaming submodular maximization, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.57)

Add feedback

Fairness in Streaming Submodular Maximization: Algorithms and Hardness

Neural Information Processing SystemsOct-10-2024, 22:24:57 GMT

Submodular maximization has become established as the method of choice for the task of selecting representative and diverse summaries of data. However, if datapoints have sensitive attributes such as gender or age, such machine learning algorithms, left unchecked, are known to exhibit bias: under- or over-representation of particular groups. This has made the design of fair machine learning algorithms increasingly important. In this work we address the question: Is it possible to create fair summaries for massive datasets? To this end, we develop the first streaming approximation algorithms for submodular maximization under fairness constraints, for both monotone and non-monotone functions. We validate our findings empirically on exemplar-based clustering, movie recommendation, DPP-based summarization, and maximum coverage in social networks, showing that fairness constraints do not significantly impact utility.

algorithm and hardness, fairness constraint, streaming submodular maximization

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Fairness in Streaming Submodular Maximization over a Matroid Constraint

Halabi, Marwa El, Fusco, Federico, Norouzi-Fard, Ashkan, Tardos, Jakab, Tarnawski, Jakub

arXiv.org Artificial IntelligenceOct-19-2023

Streaming submodular maximization is a natural model for the task of selecting a representative subset from a large-scale dataset. If datapoints have sensitive attributes such as gender or race, it becomes important to enforce fairness to avoid bias and discrimination. This has spurred significant interest in developing fair machine learning algorithms. Recently, such algorithms have been developed for monotone submodular maximization under a cardinality constraint. In this paper, we study the natural generalization of this problem to a matroid constraint. We give streaming algorithms as well as impossibility results that provide trade-offs between efficiency, quality and fairness. We validate our findings empirically on a range of well-known real-world applications: exemplar-based clustering, movie recommendation, and maximum coverage in social networks.

fairness, matroid constraint, streaming submodular maximization

arXiv.org Artificial Intelligence

2305.15118

Genre: Research Report (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.87)

Add feedback

Do Less, Get More: Streaming Submodular Maximization with Subsampling

Feldman, Moran, Karbasi, Amin, Kazemi, Ehsan

Neural Information Processing SystemsFeb-14-2020, 06:27:12 GMT

In this paper, we develop the first one-pass streaming algorithm for submodular maximization that does not evaluate the entire stream even once. By carefully subsampling each element of the data stream, our algorithm enjoys the tightest approximation guarantees in various settings while having the smallest memory footprint and requiring the lowest number of function evaluations. More specifically, for a monotone submodular function and a $p$-matchoid constraint, our randomized algorithm achieves a $4p$ approximation ratio (in expectation) with $O(k)$ memory and $O(km/p)$ queries per element ($k$ is the size of the largest feasible solution and $m$ is the number of matroids used to define the constraint). To the best or our knowledge, our algorithm is the first that combines the benefits of streaming and subsampling in a novel way in order to truly scale submodular maximization to massive machine learning problems. To showcase its practicality, we empirically evaluated the performance of our algorithm on a video summarization application and observed that it outperforms the state-of-the-art algorithm by up to fifty-fold while maintaining practically the same utility.

algorithm, streaming submodular maximization, subsampling, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.84)

Add feedback

Collaborating Authors

streaming submodular maximization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Do Less, Get More: Streaming Submodular Maximization with Subsampling

Fairness in Streaming Submodular Maximization: Algorithms and Hardness

Review for NeurIPS paper: Fairness in Streaming Submodular Maximization: Algorithms and Hardness

Fairness in Streaming Submodular Maximization: Algorithms and Hardness

Fairness in Streaming Submodular Maximization over a Matroid Constraint

Do Less, Get More: Streaming Submodular Maximization with Subsampling